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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT A Commonwealth Scientific and Industrial 

Research Organisation and 
Pacific Seeds Pty. Ltd. 



10 



(ia) INVENTORS: 



P. D. CHRISTIAN, K. H. J. GORDON and 
T. N . HANZLIK 



15 



(ii) TITLE OF INVENTION: 



(iii) NUMBER OF SEQUENCER 



INSECT VIRUSES AND THEIR USES IN 
PROTECTING PLANTS 

/ 



20 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE tfOLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE : VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 



25 



30 



35 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dis* 

(B) COMPUTER: IBM PC compati&le 

(C) OPERATING SYSTEM: PC-DOS/flfS-DOS 

(D) SOFTWARE: Patentln Release\# 1 . 0 , Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 13 AUGUST 1993 

(C) CLASSIFICATION: * 

(viii) ATTORNEY /AGENT INFORMATION: 
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(A) NAME: JOHN M . S LATTERY 

(B) REGISTRATION NUMBER: NA 

(C) REFERENCE/DOCKET NUMBER: 1613611 

5 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (613) 254 2777 




(2) INFORMATION FOR SEQ ID NO:l: 



{i} SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
GGATCCACAG NNN 



(2) INFORMATION FOR SEQ ID NO:2: 

(i)~ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
ATGGGCGATG CCGGCGTCGC GTTCACAG 



{2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
ATGGAGGATG CTGGAGTGGC GTCACAG 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
AT GAGC GAGG CCGGCGTCGC GTCACAG 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 {DJ TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

0 

CCATCGATGC CGGACTGGTA TCCCAGGGGG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 31 base pairs 
(B) TYPE: nucleic acid 
(C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
CCATCGATGC CGGACTGGTA TCCCGAGGGA C 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
- ~ (A) -LENGTH: 39 -base -pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



CC ATCGATGA TCCAGCCTCC TCGCGGCGCC GGATGGGCA 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GGAGATC T AC ATATGGGAGA TGCTGGAGTG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GTAGCGAACG TCGAGAA 



(2) INFORMATION FOR SEQ ID NO: 19: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GGGGGATCCT CAGTTGTCAG TGGC GGGGTA G 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGGGATCCCT AATTGGCACG AGCGGCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AATTACATAT GGCGGCCGCC GTTTCTGCC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
■ (B)_ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AATTACATAT GTTCGCGGCC GCCGTTTCT 



(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N terminal 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



10 



15 



Phe Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val 
15 10 1'5 



Leu Lys Ser 



20 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - internal 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Pro Thr Leu Val Asp Gin Gly Phe Trp lie Gly Gly Gin Tyr Ala Leu 
1 5 10 15 

35 

Thr Pro Thr Ser 
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(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Phe Ala Ala Ala Val Ser 

1 5 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GCGCCCCCUG GGAUACCAGG AUC 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCAGCAGGTG GCATAGG 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
- _ ..(D). TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6.. 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



CCCAT ATG GGC GAT GCC GGC GTC GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
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N 
Si 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 6 . . 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCCAT ATG AGC GAG GCC GGC GTC GCG TCA CAG 
Met Ser Glu Ala Gly Val Ala Ser Gin 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Met Ser Glu Ala Gly Val Ala Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2*7 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. . 27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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ATG GGA GAT GCT GGA GTG GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 : 
GGGGGATCCG TTCTGCCTCC CCGGAC 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 37.. 51 4 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 



GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACA ATG TAC GCG AAA GCG ACA 

Met Tyr Ala Lys Ala Thr 
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GAA GTC CAG AGG CGC CAC GGC TCC AGC ATT GAG CTG CGC ATC ACT CGC 1014 
Glu Val Gin Arg Arg His Gly Ser Ser lie Glu Leu Arg lie Thr Arg 
315 320 325 

5 GCG CCA CCT GGA GAC CGC ATG CTG GCC GTC GTC CCA AGG ACG TCC CAA 1062 

Ala Pro Pro Gly Asp Arg Met Leu Ala Val Val Pro Arg Thr Ser Gin 
330 335 340 

GGC CTC TGC AGA ATC CCA AAC ATC TTT TAT TAC GCC GAC GCG TCG GGC 1110 
10 Gly Leu Cys Arg lie Pro Asn lie Phe Tyr Tyr Ala Asp Ala Ser Gly 

345 350 355 

ACT GAG CAT AAG ACC ATC CTT ACG TCA CAG CAC AAA GTC AAC ATG CTG 1158 
Thr Glu His Lys Thr lie Leu Thr Ser Gin His Lys Val Asn Met Leu 
15 360 365 370 

CTC AAT TTT ATG CAA ACG CGT CCT GAG AAG GAA CTA GTC GAC ATG ACC 1206 

Leu Asn Phe Met Gin Thr Arg Pro Glu Lys Glu Leu Val Asp Met Thr 

375 380 385 390 

20 

GTC TTG ATG TCG TTC GCG CGC GCT AGG CTG CGC GCG ATC GTG GTC GCC 12 54 

Val Leu Met Ser_Phe Ala,Arg^ Ala Arg Leu Arg Ala lie Val Val Ala 

395 400 405 

25 TCA GAA GTC ACC GAG AGC TCC TGG AAC ATC TCA CCG GCT GAC CTG GTC 1302 

Ser Glu Val Thr Glu Ser Ser Trp Asn lie Ser Pro Ala Asp Leu Val 
410 .415 420 

CGC ACT GTC GTG TCT CTT TAC GTC CTC CAC ATC ATC GAG CGC CGA AGG 1350 
30 Arg Thr Val Val Ser Leu Tyr Val Leu His lie lie Glu Arg Arg Arg 

425 430 435 

GCT GCG GTC GCT GTC AAG ACC GCC AAG GAC GAC GTC TTT GGA GAG ACT 1398 
Ala Ala Val Ala Val Lys Thr Ala Lys Asp Asp Val Phe Gly Glu Thr 
35 440 445 450 
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TCG TTC TGG GAG AGT CTC AAG CAC GTC TTG GGC TCC TGT TGC GGT CTG 14 46 

Ser Phe Trp Glu Ser Leu Lys His Val Leu Gly Ser Cys Cys Gly Leu 
455 460 465 470 



CGC AAC CTC AAA GGC ACC GAC GTC GTC TTT ACT AAG CGC GTC GTC GAT 14 94 

Arg Asn Leu Lys Gly Thr Asp Val Val Phe Thr Lys Arg Val Val Asp 
475 480 485 



AAG TAC CGA GTC CAC TCG CTC GGA GAC ATA ATC TGC GAC GTC CGC CTG 
Lys Tyr Arg Val His Ser Leu Gly Asp lie lie Cys Asp Val Arg Leu 
490 495 500 



TCC CCT GAA CAG GTC GGC TTC CTG CCG TCC CGC GTA CCA CCT GCC CGC 15 90 

Ser Pro Glu Gin Val Gly Phe Leu Pro Ser Arg Val Pro Pro Ala Arg 
505 510 515 



GTC TTT CAC GAC AGG GAA GAG CTT GAG GTC CTT CGC GAA GCT GGC TGC 1638 
Val Phe His Asp Arg Glu Glu Leu Glu Val Leu Arg Glu Ala Gly Cys 
520 525 530 



TAC AAC GAA CGT CCG GTA CCT TCC ACT CCT CCT GTG GAG GAG CCC CAA 168 6 

Tyr Asn Glu Arg Pro Val Pro Ser Thr Pro Pro Val Glu Glu Pro Gin 
535 540 545 550 



GGT TTC GAC GCC GAC TTG TGG CAC GCG ACC GCG GCC TCA CTC CCC GAG 
Gly Phe Asp Ala Asp Leu Trp His Ala Thr Ala Ala Ser Leu Pro Glu 
555 560 565 



TAC CGC GCC ACC TTG CAG GCA GGT CTC AAC ACC GAC GTC AAG CAG CTC 
Tyr Arg Ala Thr Leu Gin Ala Gly Leu Asn Thr Asp Val Lys Gin Leu 
570 575 580 



AAG ATC ACC CTC GAG AAC GCC CTC AAG ACC ATC GAC GGG CTC ACC CTC 
Lys lie Thr Leu Glu Asn Ala Leu Lys Thr lie Asp Gly Leu Thr Leu 
585 590 595 
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TCC CCA GTC AGA GGC CTC GAG ATG TAC GAG GGC CCG CCA GGC AGC GGC 

Ser Pro Val Arg Gly Leu Glu Met Tyr Glu Gly Pro Pro Gly Ser Gly 
600 605 610 

AAG ACG GGC ACC CTC ATC GCC GCC CTT GAG GCC GCG GGC GGT AAA GCA 

Lys Thr Gly Thr Leu lie Ala Ala Leu Glu Ala Ala Gly Gly Lys Ala 

615 620 625 630 

CTT TAC GTG GCA CCC ACC AGA GAA CTG AGA GAG GCT ATG GAC CGG CGG 

Leu Tyr Val Ala Pro Thr Arg Glu Leu Arg Glu Ala Met Asp Arg Arg 
635 640 645 

ATC AAA CCG CCG TCC GCC TCG GCT ACG CAA CAT GTC GCC CTT GCG ATT 

lie Lys Pro Pro Ser Ala Ser Ala Thr Gin His Val Ala Leu Ala lie 
650 655 660 

CTC CGT CGT GCC ACC GCC GAG GGC GCC CCT TTC GCT ACC GTG GTT ATC 

Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro Phe Ala Thr Val Val He 
665 670 675 

GAC GAG TGC TTC ATG TTC CCG CTC GTG TAC GTC GCG ATC GTG CAC GCC 

Asp Glu Cys Phe Met Phe Pro Leu Val Tyr Val Ala He Val His Ala 
680 685 690 

TTG TCC CCG AGC TCA CGA ATA GTC CTT GTA GGG GAC GTC CAC CAA ATC 

Leu Ser Pro Ser Ser Arg He Val Leu Val Gly Asp Val His Gin He 

695 700 705 710 

GGG TTT ATA GAC TTC CAA GGC ACA AGC GCG AAC ATG CCG CTC GTT CGC 

Gly Phe He Asp Phe Gin Gly Thr Ser Ala Asn Met Pro Leu Val Arg 
715 720 725 

GAC GTC GTT AAG CAG TGC CGT CGG CGC ACT TTC AAC CAA ACC AAG CGC 

Asp Val Val Lys Gin Cys Arg Arg Arg Thr Phe Asn Gin Thr Lys Arg 
730 735 740 
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TGT CCG GCC GAC GTC GTT GCC ACC ACG TTT TTC CAG AGC TTG TAC CCC 2310 

Cys Pro Ala Asp Val Val Ala Thr Thr Phe Phe Gin Ser Leu Tyr Pro 

745 750 755 

5 GGG TGC ACA ACC ACC TCA GGG TGC GTC GCA TCC ATC AGC CAC GTC GCC 2358 

Gly Cys Thr Thr Thr Ser Gly Cys Val Ala Ser lie Ser His Val Ala 

760 765 770 

CCA GAC TAC CGC AAC AGC CAG GCG CAA ACG CTC TGC TTC ACG CAG GAG 2 406 

10 Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr Leu Cys Phe Thr Gin Glu 

775 780 785 790 

GAA AAG TCG CGC CAC GGG GCT GAG GGC GCG ATG ACT GTG CAC GAA GCG 2 4 54 

Glu Lys Ser Arg His Gly Ala Glu Gly Ala Met Thr Val His Glu Ala 

15 795 800 805 

CAG GGA CGC ACT TTT GCG TCT GTC ATT CTG CAT TAC AAC GGC TCC ACA 2502 

Gin Gly Arg Thr Phe Ala Ser Val lie Leu His Tyr Asn Gly Ser Thr 
810 815 820 



20 



GCA GAG CAG AAG CTC CTC GCT GAG AAG TCG CAC CTT CTA GTC GGC ATC 
Ala Glu Gin Lys Leu Leu Ala Glu Lys Ser His Leu Leu Val Gly lie 
825 830 835 



25 
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ACG CGC CAC ACC AAC CAC CTG TAC ATC CGC GAC CCG ACA GGT GAC ATT 

Thr Arg His Thr Asn His Leu Tyr lie Arg Asp Pro Thr Gly Asp lie 
840 845 850 

GAG AGA CAA CTC AAC CAT AGC GCG AAA GCC GAG GTG TTT ACA GAC ATC 

Glu Arg Gin Leu Asn His Ser Ala Lys Ala Glu Val Phe Thr Asp lie 
855 860 865 870 

CCT GCA CCC CTG GAG ATC ACG ACT GTC AAA CCG AGT GAA GAG GTG CAG 

Pro Ala Pro Leu Glu lie Thr Thr Val Lys Pro Ser Glu Glu Val Gin 
875 880 885 

CGC AAC GAA GTG ATG GCA ACG ATA CCC CCG CAG AGT GCC ACG CCG CAC 

Arg Asn Glu Val Met Ala Thr lie Pro Pro Gin Ser Ala Thr Pro His 
890 895 900 

GGA GCA ATC CAT CTG CTC CGC AAG AAC TTC GGG GAC CAA CCC GAC TGT 

Gly Ala lie His Leu Leu Arg Lys Asn Phe Gly Asp Gin Pro Asp Cys 

905 910 915 

GGC TGT GTC GCT TTG GCG AAG ACC GGC TAC GAG GTG TTT GGC GGT CGT 

Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr Glu Val Phe Gly Gly Arg 
920 925 930 
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GCC AAA ATC AAC GTA GAG CTT GCC GAA CCC GAC GCG ACC CCG AAG CCG 2 886 

Ala Lys lie Asn Val Glu Leu Ala Glu Pro Asp Ala Thr Pro Lys Pro 
935 940 945 950 

CAT AGG GCG TTC CAG GAA GGG GTA CAG TGG GTC AAG GTC ACC AAC GCG 2 934 

His Arg Ala Phe Gin Glu Gly Val Gin Trp Val Lys Val Thr Asn Ala 

955 960 965 

TCT AAC AAA CAC CAG GCG CTC CAG ACG CTG TTG TCC CGC TAC ACC AAG 2 982 

Ser Asn Lys His Gin Ala Leu Gin Thr Leu Leu Ser Arg Tyr Thr Lys 

970 975 980 



CGA AGC GCT GAC CTG CCG CTA CAC GAA GCT AAG GAG GAC GTC AAA CGC 3030 
15 Arg Ser Ala Asp Leu Pro Leu His Glu Ala Lys Glu Asp Val Lys Arg 

985 990 995 

ATG CTA AAC TCG CTT GAC CGA CAT TGG GAC TGG ACT GTC ACT GAA GAC 3078 
Met Leu Asn Ser Leu Asp Arg His Trp Asp Trp Thr Val Thr Glu Asp 
20 1000 1005 1010 

- GCC-CGT -GAC CGA GCT GTC- TTC GAG ACC CAG CTC AAG TTC ACC CAA CGC _ 3126 

Ala Arg Asp Arg Ala Val Phe Glu Thr Gin Leu Lys Phe Thr Gin Arg 
1015 1020 1025 1030 

25 

GGC GGC ACC GTC GAA GAC CTG CTG GAG CCA GAC GAC CCC TAC ATC CGT 317 4 

Gly Gly Thr Val Glu Asp Leu Leu Glu Pro Asp Asp Pro Tyr He Arg 
1035 1040 1045 

30 GAC ATA GAC TTC CTT ATG AAG ACT CAG CAG AAA GTG TCG CCC AAG CCG 3222 

Asp He Asp Phe Leu Met Lys Thr Gin Gin Lys Val Ser Pro Lys Pro 
1050 1055 1060 

ATC AAT ACG GGC AAG GTC GGG CAG GGG ATC GCC GCT CAC TCA AAG TCT 3270 
35 He ,Asn Thr Gly Lys Val Gly Gin Gly He Ala Ala His Ser Lys Ser 

1065 1070 1075 
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CTC AAC TTC GTC CTC GCC GCT TGG ATA CGC ATA CTC GAG GAG ATA CTC 3318 
Leu Asn Phe Val Leu Ala Ala Trp lie Arg lie Leu Glu Glu lie Leu 
1080 1085 1090 

5 CGT ACC GGG AGC CGC ACG GTC CGG TAC AGC AAC GGT CTC CCC GAC GAA 3366 

Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser Asn Gly Leu Pro Asp Glu 
1095 1100 1105 1110 

GAA GAG GCC ATG CTG CTC GAA GCG AAG ATC AAT CAA GTC CCA CAC GCC 3414 
10 Glu Glu Ala Met Leu Leu Glu Ala Lys He Asn Gin Val Pro His Ala 

1115 1120 1125 

ACG TTC GTC TCG GCG GAC TGG ACC GAG TTT GAC ACC GCC CAC AAT AAC 34 62 

Thr Phe Val Ser Ala Asp Trp Thr Glu Phe Asp Thr Ala His Asn Asn 
15 1130 1135 1140 

ACG AGT GAG CTG CTC TTC GCC GCC CTT TTA GAG CGC ATC GGC ACG CCT 3510 

Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu Glu Arg He Gly Thr Pro 

1145 1150 1155 

20 

GCA GCT GCC GTT AAT CTA TTC AGA GAA CGG TGT GGG AAA CGC ACC TTG 3558 

. _ .Ma Ala Ala- Val Asn Leu .Phe Arg Glu .Arg Cys_ Gly Lys Arg Thr Leu 

1160 1165 1170 

25 CGA GCG AAG GGT CTA GGC TCC GTT GAA GTC GAC GGT CTG CTC GAC TCC 3606 

Arg Ala Lys Gly Leu Gly Ser Val Glu Val Asp Gly Leu Leu Asp Ser 
1175 1180 1185 1190 

GGC GCA GCT TGG ACG CCT TGC CGC AAC ACC ATC TTC TCT GCC GCC GTC . 3654 
30 Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr He Phe Ser Ala Ala Val 

1195 1200 1205 

ATG CTC ACG CTC TTC CGC GGC GTC AAG TTC GCA GCT TTC AAA GGC GAC 3702 
Met Leu Thr Leu Phe Arg Gly Val Lys Phe Ala Ala Phe Lys Gly Asp 
35 1210 1215 1220 
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GAC TCG CTC CTC TGT GGT AGC CAT TAC CTC CGT TTC GAC GCT AGC CGC 
Asp Ser Leu Leu Cys Gly Ser His Tyr Leu Arg Phe Asp Ala Ser Arg 
1225 1230 1235 



CTT CAC ATG GGC GAA CGT TAC AAG ACC AAA CAT TTG AAG GTC GAG GTG 
Leu His Met Gly Glu Arg Tyr Lys Thr Lys His Leu Lys Val Glu Val 
1240 1245 1250 

CAG AAA ATC GTG CCG TAC ATC GGA CTC CTC GTC TCC GCT GAG CAG GTC 
Gin Lys He Val Pro Tyr He Gly Leu Leu Val Ser Ala Glu Gin Val 
1255 1260 1265 1270 



GTC CTC GAC CCT GTC AGG AGC GCT CTC AAG ATA TTT GGG CGC TGC TAC 
Val Leu Asp Pro Val Arg Ser Ala Leu Lys He Phe Gly Arg Cys Tyr 
1275 1280 1285 



ACA AGC GAA CTC CTT TAC TCC AAG TAC GTG GAG GCT GTG AGA GAC ATC 
Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val Glu Ala Val Arg Asp He 
1290 1295 1300 



ACC AAG GGC TGG AGT GAC GCC CGC TAC CAC AGC CTC CTG TGC CAC ATG 
Thr Lys Gly Trp Ser Asp Ala Arg Tyr His Ser Leu Leu Cys His Met 
1305 1310 1315 



TCA GCA TGC TAC TAC AAT TAC GCG CCG GAG TCT GCG GCG TAC ATC ATC 
Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu Ser Ala Ala Tyr He He 
1320 1325 1330 



GAC GCT GTT GTT CGC TTT GGG CGC 
Asp Ala Val Val Arg Phe Gly Arg 
1335 1340 



GGC GAC TTC CCG TTT GAA CAA CTG 
Gly Asp Phe Pro Phe Glu Gin Leu 
1345 1350 



CGC GTG GTG CGT 
Arg Val Val Arg 



GCC CAT GTG CAG GCA CCC GAC GCT TAC AGC 
Ala His Val Gin Ala Pro Asp Ala Tyr Ser 
1355 1360 



AGC ACG 
Ser Thr 
1365 
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TAT CCG GCT AAC GTG CGC GCA TCG TGC CTT GAC CAC GTC TTC GAG CCC 4162 
Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu Asp His Val Phe Glu Pro 
1370 1375 1380 

5 CGC CAG GCC GCC GCC CCG GCA GGT TTC GTT GCG ACA TGT GCG AAG CCG 42 30 

Arg Gin Ala Ala Ala Pro Ala Gly Phe Val Ala Thr Cys Ala Lys Pro 
1385 1390 1395 

GAA ACG CCT TCT TCA CTT ACC GCG AAA GCT GGT GTT TCT GCG ACT ACA 427 8 

10 Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala Gly Val Ser Ala Thr Thr 

1400 1405 1410 

AGC CAC GTT GCG ACT GGG ACT GCG CCC CCG GAG TCT CCA TGG GAT GCA 4 326 

Ser His Val Ala Thr Gly Thr Ala Pro Pro Glu Ser Pro Trp Asp Ala 
15 1415 1420 1425 1430 

CCT GCA GCC AAC AGC TTT TCG GAG TTA TTG ACA CCG GAG ACC CCG TCC 4 37 4 

Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu Thr Pro Glu Thr Pro Ser 
1435 1440 1445 

20 

ACA TCA TCC TCG CCG TCA TCG TCT TCA TCG GAC TCC TCT ACA TCG TGT 4 422 

Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser Asp Ser Ser Thr Ser Cys 
1450 1455 1460 

25 GGA AGG TCG CTC AGT GGT GGA GAC ACC GCA AGG ACC ACA GAA GAC TTG 44 70 

Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala Arg Thr Thr Glu Asp Leu 
1465 1470 1475 

AAC AGC AGA AAG CCG CCT TCG CAA GAC AGG CAA TCA CGC TCG TCT GAA 4 518 

30 Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg Gin Ser Arg Ser Ser Glu 

1480 1485 1490 

TGT CTG GAC AGA AGC GGA GAA AGG ACA GGC AGT TCG TTA ACT GCC CCC 4 566 

Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly Ser Ser Leu Thr Ala Pro 
35 1495 1500 1505 1510 
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ACT GCT CCG AGC CCC TCA TTC TCA TTT TCG GAA AGA GCT CGA CTG GCG 4 614 

Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser Glu Arg Ala Arg Leu Ala 
1515 1520 1525 

5 ACC GGG CCG ACT GTC GCC GCT GCG ACA TCA CCT TCG GCA ACC CCA TCC 4662 

Thr Gly Pro Thr Val Ala Ala Ala Thr Ser Pro Ser Ala Thr Pro Ser 
1530 1535 1540 

TGC GCC ACG GAC CAG GTT GCC GCG AGG ACC ACG CCG GAC TTT GCG CCT 4 710 

10 Cys Ala Thr Asp Gin Val Ala Ala Arg Thr Thr Pro Asp Phe Ala Pro 

1545 1550 1555 

TTC CTG GGT TCC CAG TCT GCC CGT GCT GTC TCG AAG CCG TAC CGG CCC 47 58 

Phe Leu Gly Ser Gin Ser Ala Arg Ala Val Ser Lys Pro Tyr Arg Pro 
15 1560 1565 1570 

CCC ACG ACT GCC CGT TGG AAA GAA GTC ACC CCG CTC CAC GCG TGG AAG 4 806 

Pro Thr Thr Ala Arg Trp Lys Glu Val Thr Pro Leu His Ala Trp Lys 
1575 1580 1585 1590 

20 

GGC GTG ACC GGA GAC CGA CCG GAA GTC AGG GAG GAC CCG GAG ACA GCG 48 54 

_ Gly Val Thr Gly Asp Arg Pro Glu Val Arg Glu Asp Pro Glu Thr Ala 
1595 1600 1605 

25 GCG GTC GTC CAG GCT CTG ATC AGC GGC CGT TAT CCT CAG AAG ACG AAG 4 902 

Ala Val Val Gin Ala Leu lie Ser Gly Arg Tyr Pro Gin Lys Thr Lys 
1610 1615 1620 

CTT TCC TCC GAC GCA TCC AAA GGC TAC TCA AGA ACT AAG GGA TGC TCA 4 950 

30 Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser Arg Thr Lys Gly Cys Ser 

1625 1630 1635 

CAA TCC ACC TCT TTT CCT GCC CCG AGT GCG GAT TAC CAG GCC CGC GAC 4 998 

Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala Asp Tyr Gin Ala Arg Asp 
35 1640 1645 1650 
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TGC CAG ACA GTC CGA GTC TGC CGC GCC GCT GCA GAG ATG GCG CGC TCA 504 6 

Cys Gin Thr Val Arg Val Cys Arg Ala Ala Ala Glu Met Ala Arg Ser 
1655 1660 1665 1670 

5 TGT ATT CAC GAG CCG TTG GCT TCA TCT GCC GCC AGT GCC GAC TTG AAG 5094 

Cys lie His Glu Pro Leu Ala Ser Ser Ala Ala Ser Ala Asp Leu Lys 
1675 1680 1685 

CGC ATA CGC TCT ACC TCG GAC TCT GTT CCC GAT GTA AAG ATC AGC AAG 5142 
10 Arg lie Arg Ser Thr Ser Asp Ser Val Pro Asp Val Lys lie Ser Lys 

1690 1695 1700 
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AGC GCA TGAAGGAACA AAATTAGTTT CCTTGTTCGT AAACAAGGTG GTCCCTCCCA 5198 
Ser Ala 

5 

T TGAGGT AAA GACTCTGGTG AGTCCTCAAC GTTACTCGTT GAGTCTGCTG CGGTTCGATT 5258 



CCATTCCCAA GCAGCAAAGG GTGCGCAACT AGTACGGCGC CCCCTGGGAT ACCA 5312 

10 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1704 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Met Tyr Ala Lys Ala Thr Asp Val Ala Arg Val Tyr Ala Ala Ala Asp 
25 1 5 10 15 



Val Ala Tyr Ala Asn Val Leu Gin Gin Arg Ala Val Lys Leu Asp Phe 
20 25 30 

30 Ala Pro Pro Leu Lys Ala Leu Glu Thr Leu His Arg Leu Tyr Tyr Pro 

35 40 45 

Leu Arg Phe Lys Gly Gly Thr Leu Pro Pro Thr Gin His Pro lie Leu 
50 55 60 

35 

Ala Gly His Gin Arg Val Ala Glu Glu Val Leu His Asn Phe Ala Arg 
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65 



Gly Arg Ser Thr Val Leu Glu lie Gly Pro Ser Leu His Ser Ala Leu 
85 90 95 

Lys Leu His Gly Ala Pro Asn Ala Pro Val Ala Asp Tyr His Gly Cys 
100 105 110 

Thr Lys Tyr Gly Thr Arg Asp Gly Ser Arg His lie Thr Ala Leu Glu 
115 120 125 

Ser Arg Ser Val Ala Thr Gly Arg Pro Glu Phe Lys Ala Asp Ala Ser 
130 135 140 

Leu Leu Ala Asn Gly lie Ala Ser Arg Thr Phe Cys Val Asp Gly Val 
145 150 155 160 

Gly Ser Cys Ala Phe Lys Ser Arg Val Gly lie Ala Asn His Ser Leu 
165 170 175 

Tyr Asp Val Thr Leu Glu Glu Leu Ala Asn Ala Phe Glu Asn His Gly 
180 185 190 

Leu His Met Val Arg Ala Phe Met His Met Pro Glu Glu Leu Leu Tyr 
195 200 205 

Met Asp Asn Val Val Asn Ala Glu Leu Gly Tyr Arg Phe His Val lie 
210 215 220 

Glu Glu Pro Met Ala Val Lys Asp Cys Ala Phe Gin Gly Gly Asp Leu 
225 230 235 240 

Arg Leu His Phe Pro Glu Leu Asp Phe lie Asn Glu Ser Gin Glu Arg 
245 250 255 

Arg lie Glu Arg Leu Ala Ala Arg Gly Ser Tyr Ser Arg Arg Ala Val 
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He Phe Ser Gly Asp Asp Asp Trp Gly Asp Ala Tyr Leu His Asp Phe 
275 280 285 

His Thr Trp Leu Ala Tyr Leu Leu Val Arg Asn Tyr Pro Thr Pro Phe 
290 295 300 

Gly Phe Ser Leu His He Glu Val Gin Arg Arg His Gly Ser Ser He 
305 310 315 320 

Glu Leu Arg He Thr Arg Ala Pro Pro Gly Asp Arg Met Leu Ala Val 
325 330 335 

Val Pro Arg Thr Ser Gin Gly Leu Cys Arg He Pro Asn He Phe Tyr 
340 345 350 

Tyr Ala Asp Ala Ser Gly Thr Glu His Lys Thr He Leu Thr Ser Gin 
355 360 365 

His Lys Val Asn Met Leu Leu Asn Phe Met Gin Thr Arg Pro Glu Lys 

. 370 ... 375 380 

Glu Leu Val Asp Met Thr Val Leu Met Ser Phe Ala Arg Ala Arg Leu 
385 390 395 400 

Arg Ala He Val Val Ala Ser Glu Val Thr Glu Ser Ser Trp Asn He 
405 410 415 

Ser Pro Ala Asp Leu Val Arg Thr Val Val Ser Leu Tyr Val Leu His 
420 425 430 

He He Glu Arg Arg Arg Ala Ala Val Ala Val Lys Thr Ala Lys Asp 
435 440 445 

Asp Val Phe Gly Glu Thr Ser Phe Trp Glu Ser Leu Lys His Val Leu 
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450 



Gly Ser Cys Cys Gly Leu Arg Asn Leu Lys Gly Thr Asp Val Val Phe 
465 470 475 480 

Thr Lys Arg Val Val Asp Lys Tyr Arg Val His Ser Leu Gly Asp He 
485 490 495 

He Cys Asp Val Arg Leu Ser Pro Glu Gin Val Gly Phe Leu Pro Ser 
500 505 510 

Arg Val Pro Pro Ala Arg Val Phe His Asp Arg Glu Glu Leu Glu Val 
515 520 525 

Leu Arg Glu Ala Gly Cys Tyr Asn Glu Arg Pro Val Pro Ser Thr Pro 
530 535 540 

Pro Val Glu Glu Pro Gin Gly Phe Asp Ala Asp Leu Trp His Ala Thr 
545 550 555 560 

Ala Ala Ser Leu Pro Glu Tyr Arg Ala Thr Leu Gin Ala Gly Leu Asn 

- - 565 - 570 . 575 

Thr Asp Val Lys Gin Leu Lys He Thr Leu Glu Asn Ala Leu Lys Thr 
580 585 590 

He Asp Gly Leu Thr Leu Ser Pro Val Arg Gly Leu Glu Met Tyr Glu 
595 600 605 

Gly Pro Pro Gly Ser Gly Lys Thr Gly Thr Leu He Ala Ala Leu Glu 
610 615 620 

Ala Ala Gly Gly Lys Ala Leu Tyr Val Ala Pro Thr Arg Glu Leu Arg 
625 630 635 640 

Glu Ala Met Asp Arg Arg He Lys Pro Pro Ser Ala Ser Ala Thr Gin 
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645 650 655 

His Val Ala Leu Ala lie Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro 

660 665 670 

Phe Ala Thr Val Val lie Asp Glu Cys Phe Met Phe Pro Leu Val Tyr 

675 680 685 

Val Ala He Val His Ala Leu Ser Pro Ser Ser Arg He Val Leu Val 

690 695 700 

Gly Asp Val His Gin He Gly Phe He Asp Phe Gin Gly Thr Ser Ala 

705 710 715 720 

Asn Met Pro Leu Val Arg Asp Val Val Lys Gin Cys Arg Arg Arg Thr 

725 730 735 

Phe Asn Gin Thr Lys Arg Cys Pro Ala Asp Val Val Ala Thr Thr Phe 

740 745 750 

Phe Gin Ser Leu Tyr Pro Gly Cys Thr Thr Thr Ser Gly Cys Val Ala 

755 . .760 . _ 765 

Ser He Ser His Val Ala Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr 

770 775 780 

Leu Cys Phe Thr Gin Glu Glu Lys Ser Arg His Gly Ala Glu Gly Ala 

785 790 795 800 

Met Thr Val His Glu Ala Gin Gly Arg Thr Phe Ala Ser Val He Leu 

805 810 815 

His Tyr Asn Gly Ser Thr Ala Glu Gin Lys Leu Leu Ala Glu Lys Ser 

820 825 830 

His Leu Leu Val Gly He Thr Arg His Thr Asn His Leu Tyr He Arg 
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835 840 845 

Asp Pro Thr Gly Asp lie Glu Arg Gin Leu Asn His Ser Ala Lys Ala 

850 855 860 

Glu Val Phe Thr Asp lie Pro Ala Pro Leu Glu lie Thr Thr Val Lys 

865 870 875 880 

Pro Ser Glu Glu Val Gin Arg Asn Glu Val Met Ala Thr He Pro Pro 

885 890 895 

Gin Ser Ala Thr Pro His Gly Ala He His Leu Leu Arg Lys Asn Phe 

900 905 910 

Gly Asp Gin Pro Asp Cys Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr 

915 920 925 

Glu Val Phe Gly Gly Arg Ala Lys He Asn Val Glu Leu Ala Glu Pro 

930 935 940 

Asp Ala Thr Pro Lys Pro His Arg Ala Phe Gin Glu Gly Val Gin Trp 

94 5 950 955 960 

Val Lys Val Thr Asn Ala Ser Asn Lys His Gin Ala Leu Gin Thr Leu 

965 970 975 

Leu Ser Arg Tyr Thr Lys Arg Ser Ala Asp Leu Pro Leu His Glu Ala 

980 985 990 

Lys Glu Asp Val Lys Arg Met Leu Asn Ser Leu Asp Arg His Trp Asp 

995 1000 1005 

Trp Thr Val Thr Glu Asp Ala Arg Asp Arg Ala Val Phe Glu Thr Gin 

1010 1015 1020 . '"' 

Leu Lys Phe Thr Gin Arg Gly Gly Thr Val Glu Asp Leu Leu Glu Pro 
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1035 



Asp Asp Pro Tyr lie Arg Asp lie Asp Phe Leu Met Lys Thr Gin Gin 
1045 1050 1055 

Lys Val Ser Pro Lys Pro lie Asn Thr Gly Lys Val Gly Gin Gly lie 
1060 1065 1070 

Ala Ala His Ser Lys Ser Leu Asn Phe Val Leu Ala Ala Trp lie Arg 
1075 1080 1085 

lie Leu Glu Glu lie Leu Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser 
1090 1095 1100 

Asn Gly Leu Pro Asp Glu Glu Glu Ala Met Leu Leu Glu Ala Lys lie 
1105 1110 1115 1120 

Asn Gin Val Pro His Ala Thr Phe Val Ser Ala Asp Trp Thr Glu Phe 
1125 1130 1135 

Asp Thr Ala His Asn Asn Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu 
1140 1145 1150 

Glu Arg lie Gly Thr Pro Ala Ala Ala Val Asn Leu Phe Arg Glu Arg 
1155 1160 1165 

Cys Gly Lys Arg Thr Leu Arg Ala Lys Gly Leu Gly Ser Val Glu Val 
1170 1175 1180 

Asp Gly Leu Leu Asp Ser Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr 
1185 1190 1195 1200 

lie Phe Ser Ala Ala Val Met Leu Thr Leu Phe Arg Gly Val Lys Phe 
1205 1210 ' 1215 

Ala Ala Phe Lys Gly Asp Asp Ser Leu Leu Cys Gly Ser His Tyr Leu 
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1230 
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Arg Phe Asp Ala Ser Arg Leu His Met Gly Glu Arg Tyr Lys Thr Lys 
1235 1240 1245 

His Leu Lys Val Glu Val Gin Lys He Val Pro Tyr He Gly Leu Leu 
1250 1255 1260 

Val Ser Ala Glu Gin Val Val Leu Asp Pro Val Arg Ser Ala Leu Lys 
1265 1270 1275 1280 

He Phe Gly Arg Cys Tyr Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val 
1285 1290 1295 

Glu Ala Val Arg Asp He Thr Lys Gly Trp Ser Asp Ala Arg Tyr His 
1300 1305 1310 

Ser Leu Leu Cys His Met Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu 
1315 1320 1325 

Ser Ala Ala Tyr He He Asp Ala Val Val Arg Phe Gly Arg Gly Asp 
1330 1335 1340 

Phe Pro Phe Glu Gin Leu Arg Val Val Arg Ala His Val Gin Ala Pro 
1345 1350 1355 1360 

Asp Ala Tyr Ser Ser Thr Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu 
1365 1370 1375 

Asp His Val Phe Glu Pro Arg Gin Ala Ala Ala Pro Ala Gly Phe Val 
1380 1385 1390 

Ala Thr Cys Ala Lys Pro Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala 
1395 1400 1405 

Gly Val Ser Ala Thr Thr Ser His Val Ala Thr Gly Thr Ala Pro Pro 
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Glu Ser Pro Trp Asp Ala Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu 
1425 1430 1435 1440 

Thr Pro Glu Thr Pro Ser Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser 
1445 1450 1455 

Asp Ser Ser Thr Ser Cys Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala 
1460 1465 1470 

Arg Thr Thr Glu Asp Leu Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg 
1475 1480 1485 

Gin Ser Arg Ser Ser Glu Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly 
1490 1495 1500 

Ser Ser Leu Thr Ala Pro Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser 
1505 1510 1515 1520 

Glu Arg Ala Arg Leu Ala Thr Gly Pro Thr Val Ala Ala Ala Thr Ser 
1525 1530 1535 

- Pro- Ser -Ala Thr Pro Ser. Cys Ala Thr Asp Gin Val Ala Ala Arg Thr 
1540 1545 1550 

Thr Pro Asp Phe Ala Pro Phe Leu Gly Ser Gin Ser Ala Arg Ala Val 
1555 1560 1565 

Ser Lys Pro Tyr Arg Pro Pro Thr Thr Ala Arg Trp Lys Glu Val Thr 
1570 1575 1580 

Pro Leu His Ala Trp Lys Gly Val Thr Gly Asp Arg Pro Glu Val Arg 
1585 1590 1595 1600 



Glu Asp Pro Glu Thr Ala Ala Val Val Gin Ala Leu lie Ser Gly Arg 
1605 1610 1615 
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Tyr Pro Gin Lys Thr Lys Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser 
1620 1625 1630 



Arg Thr Lys Gly Cys Ser Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala 
1635 1640 1645 



Asp Tyr Gin Ala Arg Asp Cys Gin Thr Val Arg Val Cys Arg Ala Ala 
1650 1655 1660 



Ala Glu Met Ala Arg Ser Cys lie His Glu Pro Leu Ala Ser Ser Ala 
1665 1670 1675 1680 
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Ala Ser Ala Asp Leu Lys Arg lie Arg Ser Thr Ser Asp Ser Val Pro 
1685 1690 1695 

Asp Val Lys lie Ser Lys Ser Ala 
1700 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 4218.. 4512 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 
GCGCGTGTCT ACGCCGCGGC AGAT GTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 
AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 
CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 
CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 
GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 
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TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GAC AC AT T AC GGCCTTAGAG 
TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 
GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 
GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 
GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 
ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 
GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 
TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 
AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT AC AC GACTTC 
CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 
CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 
CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 
AACATCTTTT AT T ACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 
CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA AC T AGTC GAC 
ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 
GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 
TACGTCCTCC ACATCATCGA GC GCC GAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 
GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 
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GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCC AC T CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

5 TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 1620 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GC GACC GCGG CCTCACTCCC CGAGTACCGC 1740 

10 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 
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GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 
GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 
AAAGC AC TTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GC GGATC AAA 
CCGCCGTCCG CCTCGGCTAC GCAACAT GTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 
GAGGGC GCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 
GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 
CAAATCGGGT TTATAGACTT CCAAGGCACA AGC GCGAAC A TGCCGCTCGT TCGCGACGTC 
GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 
GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 
TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 
CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 
CGCACTTTTG CGTCTGTCAT TCTGCAT T AC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 
GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 
GACCCGACAG GTGACATTGA GAGACAACTC AACCAT AGC G CGAAAGCCGA GGTGTTTACA 
GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GT GAAGAGGT GC AGC GC AAC 
GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC AC GGAGCAAT CCATCTGCTC 
CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGC TAC 
GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 
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AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 94 0 



AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 



5 C T AC AC GAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 



TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 



10 



20 



30 



CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3180 



GACTTCCTTA TGAAGAC TC A GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 



GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 
15 ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 



GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3 420 



GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA AC AC GAGTGA GCTGCTCTTC 3480 



GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 3 540 



TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 



25 GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCT GCCGC CGTCATGCTC 3660 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 3720 



AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 37 80 



CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 3840 



CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 



35 GAACTCCTTT AC TCC AAGT A CGTGGAGGCT GT GAGAGAC A TCACCAAGGG CTGGAGTGAC 3960 
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GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4020 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4080 

5 CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 4140 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4200 

GCAGGTTTCG TTGCGAC ATG TGC GAA GCC GGA AAC GCC TTC TTC ACT TAC 42 50 
10 Met Cys Glu Ala Gly Asn Ala Phe Phe Thr Tyr 

15 10 
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CGC GAA AGC TGG TGT TTC TGC GAC TAC AAG CCA CGT TGC GAC TGG GAC 42 98 

Arg Glu Ser Trp Cys Phe Cys Asp Tyr Lys Pro Arg Cys Asp Trp Asp 
15 20 25 

5 TGC GCC CCC GGA GTC TCC ATG GGA TGC ACC TGC AGC CAA CAG CTT TTC 4 34 6 

Cys Ala Pro Gly Val Ser Met Gly Cys Thr Cys Ser Gin Gin Leu Phe 
30 35 40 

GGA GTT ATT GAC ACC GGA GAC CCC GTC CAC ATC ATC" CTC GCC GTC ATC 4 394 

10 Gly Val lie Asp Thr Gly Asp Pro Val His lie lie Leu Ala Val lie 

45 50 55 

GTC TTC ATC GGA CTC CTC TAC ATC GTG TGG AAG GTC GCT CAG TGG TGG 4 442 

Val Phe lie Gly Leu Leu Tyr lie Val Trp Lys Val Ala Gin Trp Trp 
15 60 65 70 75 

AGA CAC CGC AAG GAC CAC AGA AGA CTT GAA CAG CAG AAA GCC GCC TTC 4 4 90 
Arg His Arg Lys Asp His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe 
80 85 90 

20 

GCA AGA CAG GCA ATC ACG CTC GTC TGAATGTC TGGACAGAAG CGGAGAAAGG 4 54 2 
Ala. Arg Gin Ala He Thr Leu Val 
95 

25 ACAGGCAGTT CGTTAACTGC CCCCACTGCT CCGAGCCCCT CATTCTCATT TTCGGAAAGA 4 602 

GCTCGACTGG CGACCGGGCC GACTGTCGCC GCTGC GACAT CACCTTCGGC AACCCCATCC 4 662 

TGC GCC AC GG ACCAGGTTGC CGCGAGGACC ACGCCGGACT TTGCGCCTTT CCTGGGTTCC 4 7 22 

30 

CAGTCTGCCC GTGCTGTCTC GAAGCCGTAC CGGCCCCCCA CGACTGCCCG TTGGAAAGAA 4 7 82 

GTCACCCCGC TCC AC GC GTG GAAGGGCGTG ACC GGAGACC GACCGGAAGT CAGGGAGGAC 4842 



35 CCGGAGACAG CGGCGGTCGT CCAGGCTCTG ATCAGCGGCC GTTATCCTCA GAAGAC GAAG 4 902 
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CTTTCCTCCG ACGCATCCAA AGGCTACTCA AGAACTAAGG GATGCTCACA ATCCACCTCT 4 962 

TTTCCTGCCC CGAGTGCGGA TTACCAGGCC CGCGACTGCC AGACAGTCCG AGTCTGCCGC 5022 

5 GCCGCTGCAG AGATGGCGCG CTCATGTATT C AC GAGCCGT TGGCTTCATC TGCCGCCAGT 5082 

GCCGACTTGA AGCGCATACG CTCTACCTCG GACTCTGTTC CCGATGTAAA GATCAGCAAG 5142 

AGCGCATGAA GGAACAAAAT TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA 52 02 

10 

GGTAAAGACT CTGGTGAGTC CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT 5262 

TCCCAAGCAG CAAAGGGTGC GCAACTAGTA CGGCGCCCCC TGGGATACCA 5312 



15 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Met Cys Glu Ala Gly Asn Ala Phe Phe Thr Tyr Arg Glu Ser Trp Cys 

15 10 15 

Phe Cys Asp Tyr Lys Pro Arg Cys Asp Trp Asp Cys Ala Pro Gly Val 

20 25 30 

Ser Met Gly Cys Thr Cys Ser Gin Gin Leu Phe Gly Val lie Asp Thr 

35 40 45 

Gly Asp Pro Val His lie lie Leu Ala Val lie Val Phe lie Gly Leu 

50 55 60 

Leu Tyr lie Val Trp Lys Val Ala Gin Trp Trp Arg His Arg Lys Asp 

65 70 75 80 

His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe Ala Arg Gin Ala lie 

85 90 95 



Thr Leu Val 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 4518.. 4937 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 
20 GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 12 0 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 



25 



30 



35 



CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 24 0 



CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 



GGGCCGTCTC T GC AC AGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 



TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GAC AC AT T AC GGCCTTAGAG 420 



TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 48 0 



GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 54 0 



GTTGGAATTG CCAATCACTC CCTCTATGAC GT GACCCTAG AGGAGCTGGC CAATGCGTTT 600 



183 

GAGAACCACG GAC TTC AC AT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCC TAT G 72 0 

5 GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 76 0 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 840 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

10 



184 



CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 



CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 



5 CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 



AACATCTTTT ATTACGCCGA CGCGTCGGGC ACT GAGC AT A AGACCATCCT TACGTCACAG 114 0 



10 



15 



20 



25 



30 



35 



CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA AC TAGTC GAC 12 00 



ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 



GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 



TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 



GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 1440 



GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 



CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 



TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 162 0 



CTTCGCGAAG CTGGCTGCTA CAAC GAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 1680 



CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 1740 



GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 



GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GATGTACGAG 1860 



GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 



AAAGCACTTT AC GTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 198 0 



185 



CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 



GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 



GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 



CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGC GAAC A TGCCGCTCGT TCGCGACGTC 



GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 



GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 



TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 



CAGGAGGAAA AGTCGC GCC A CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGC AGGGA 



CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 



GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 



GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 



GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 
GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 



CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 



GAGGTGTTTG GC GGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 



AAGCCGCATA GGGC GTTCC A GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 



AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 



CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 



186 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 

CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 3160 

5 GACTTCCTTA T GAAGACTC A GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 

GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 

ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 

10 

GAC GAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 



10 



20 



30 



187 



GTCTCGGCGG AC TGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 80 



GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 



5 TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 



GACTCC GGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 37 20 



AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GC GAACGTT A CAAGACCAAA 3780 



CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 38 40 

/ 

15 CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 



GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 3960 



GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4020 



TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4080 



CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 4140 



25 GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 42 00 



GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 4260 



GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 320 



GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 380 



TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 440 



35 GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 



188 



CAATCACGCT CGTCTGA ATG TCT GGA CAG AAG CGG AGA AAG GAC AGG CAG 4 550 

Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin 
15 10 

5 TTC GTT AAC TGC CCC CAC TGC TCC GAG CCC CTC ATT CTC ATT TTC GGA 4 598 

Phe Val Asn Cys Pro His Cys Ser Glu Pro Leu lie Leu lie Phe Gly 
15 20 25 

AAG AGC TCG ACT GGC GAC CGG GCC GAC TGT CGC CGC TGC GAC ATC ACC 4 64 6 

10 Lys Ser Ser Thr Gly Asp Arg Ala Asp Cys Arg Arg Cys Asp lie Thr 

30 35 40 



TTC GGC AAC CCC ATC CTG CGC CAC GGA CCA GGT TGC CGC GAG GAC CAC 4 694 

Phe Gly Asn Pro lie Leu Arg His Gly Pro Gly Cys Arg Glu Asp His 
45 50 55 



15 



20 



GCC GGA CTT TGC GCC TTT CCT GGG TTC CCA GTC TGC CCG TGC TGT CTC 4 7 42 

Ala Gly Leu Cys Ala Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu 
60 65 70 75 

GAA GCC GTA CCG GCC CCC CAC GAC TGC CCG TTG GAA AGA AGT CAC CCC 4 7 90 

Glu Ala Val Pro Ala Pro His Asp Cys Pro Leu Glu Arg Ser His Pro 
80 85 90 



25 GCT CCA CGC GTG GAA GGG CGT GAC CGG AGA CCG ACC GGA AGT CAG GGA 4 838 

Ala Pro Arg Val Glu Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly 
95 100 105 

GGA CCC GGA GAC AGC GGC GGT CGT CCA GGC TCT GAT CAG CGG CCG TTA 4886 
30 Gly Pro Gly Asp Ser Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu 

110 115 120 

TCC TCA GAA GAC GAA GCT TTC CTC CGA CGC ATC CAA AGG CTA CTC AAG 4934 
Ser Ser Glu Asp Glu Ala Phe Leu Arg Arg lie Gin Arg Leu Leu Lys 
35 125 130 135 



i 



189 

AAC TAAGGGATGC TCACAATCCA CCTCTTTTCC TGCCCCGAGT GCGGATTACC ■ 4 987 

Asn 

140 

5 AGGCCCGCGA CTGCCAGACA GTCCGAGTCT GCCGCGCCGC TGCAGAGATG GCGCGCTCAT 504 7 

GTATTCACGA GCCGTTGGCT TCATCTGCCG CCAGTGCCGA CTTGAAGCGC ATACGCTCTA 5107 
CCTCGGACTC TGTTCCCGAT GTAAAGATCA GCAAGAGCGC AT GAAGGAAC AAAATTAGTT 5167 

10 

TCCTTGTTCG TAAACAAGGT GGTCCCTCCC ATTGAGGTAA AGACTCTGGT GAGTCCTCAA 5227 



190 



CGTTACTCGT TGAGTCTGCT GCGGTTCGAT TCCATTCCCA AGCAGCAAAG GGTGCGCAAC 5287 
TAGTACGGCG CCCCCTGGGA TACCA 5312 

5 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A} LENGTH: 140 amino acids 

{B) TYPE: amino acid * 
(D) TOPOLOGY: linear 

{ ii) MOLECULE TYPE: protein 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin Phe Val Asn Cys Pro 
20 1 5 10 15 

His Cys Ser Glu Pro Leu lie Leu lie Phe Gly Lys Ser Ser Thr Gly 
20 25 30 

25 Asp Arg Ala Asp Cys Arg Arg Cys Asp lie Thr Phe Gly Asn Pro lie 

35 40 45 

Leu Arg His Gly Pro Gly Cys Arg Glu Asp His Ala Gly Leu Cys Ala 
50 55 60 

30 

Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu Glu Ala Val Pro Ala 
65 70 75 80 

Pro His Asp Cys Pro Leu Glu Arg Ser His Pro Ala Pro Arg Val Glu 
35 85 90 95 



191 



Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly Gly Pro Gly Asp Ser 
100 105 110 

Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu Ser Ser Glu Asp Glu 
115 120 125 

Ala Phe Leu Arg Arg lie Gin Arg Leu Leu Lys Asn 
130 135 140 



(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 4 944.. 5162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 
GCGCGTGTCT ACGCCGCGGC AGATGTC GCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 
AAGTTGGACT TCGCCCCGCC AC TGAAGGC A CTAGAAACCC TCC AC AG AC T GTACTATCCG 
CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 



192 

CGT GTCGC AG AAGAGGTTCT GC AC AATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 

GGGCCGTCTC TGCACAGCGC AC T T AAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 360 

5 TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 

TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 480 

GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 540 

10 

GTTGGAATTG CCAATCACTC CCTCTATGAC GT GACCC TAG AGGAGCTGGC CAATGCGTTT 600 



193 



GAGAACC AC G GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 



ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 720 



5 GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 780 



TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGC TGG CCGCCCGCGG CTCCTACTCC 84 0 



10 



20 



30 



AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 



CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 



CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

/ 

15 CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1080 



AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 1140 



CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA AC T AGTCGAC 12 00 



AT GACCGTC T TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 12 60 



GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 



25 TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 



GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14*40 



GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT C GAT AAGT AC 1500 



CGAGTCCACT CGC TCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 



TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 162 0 



35 CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 168 0 



194 



CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 174 0 



GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 



5 GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GATGTACGAG 1860 



GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 192 0 



10 



AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 



CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 2 040 



GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 



15 GTCGCGATCG TGCACGCCTT GTCCCCGAGC TC AC GAAT AG TCCTTGTAGG GGACGTCCAC 2160 



CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 



20 



GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 



GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 2 340 



TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2 4 00 



25 CAGGAGGAAA AGTCGCGCCA C GGGGCT GAG GGC GCGATGA CTGTGCACGA AGCGCAGGGA 24 60 



CGCACTTTTG CGTCTGTCAT TCTGCATTAC AAC GGCTCC A CAGCAGAGCA GAAGCTCCTC 2 520 



30 



GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2 580 



GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2640 



GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 



35 GAAGTGATGG C AAC GAT ACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 



195 

CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2 880 

5 AAGCC GC AT A GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2940 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 

CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 

10 

TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 



196 



CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 



GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 



GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 



ATACTCGAGG AGAT AC TCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 



GAC GAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 



GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA AC ACGAGT GA GCTGCTCTTC 



GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 



TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 



GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 



AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 



CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 



CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 



GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GTGAGAGACA TCACCAAGGG CTGGAGTGAC 



GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 



TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 



CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 



GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 



197 



10 



15 



GC AGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 4260 

GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 320 

GATGCACCTG CAGCCAACAG CTTTTCGGAG T T AT TGACAC CGGAGACCCC GTCCACATCA 4 380 

TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 440 

GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 

CAATCACGCT CGTCTGAATG TCTGGACAGA AGC GGAGAAA GGACAGGCAG TTCGTTAACT 4 560 

GCCCCCACTG CTCCGAGCCC CTCATTCTCA TTTTCGGAAA GAGCTCGACT GGCGACCGGG 4 620 

/ 

CCGACTGTCG CCGCTGCGAC ATCACCTTCG GCAACCCCAT CCTGCGCCAC GGACCAGGTT 4 68 0 

GCCGCGAGGA CCACGCCGGA CTTTGCGCCT TTCCTGGGTT CCCAGTCTGC CCGTGCTGTC 47 40 

TCGAAGCC GT ACCGGCCCCC CACGACTGCC CGTTGGAAAG AAGTCACCCC GCTCCACGCG 4800 

TGGAAGGGCG TGACCGGAGA CCGACCGGAA GTCAGGGAGG ACCC GGAGAC AGCGGCGGTC 4 860 

GTCCAGGCTC TGATCAGCGG CCGTTATCCT CAGAAGACGA AGCTTTCCTC CGACGCATCC 4 920 



25 AAAGGCTACT CAAGAACTAA GGG ATG CTC ACA ATC CAC CTC TTT TCC TGC 4 970 

Met Leu Thr lie His Leu Phe Ser Cys 
1 5 



CCC GAG TGC GGA TTA CCA GGC CCG CGA CTG CCA GAC AGT CCG AGT CTG 5018 
Pro Glu Cys Gly Leu Pro Gly Pro Arg Leu Pro Asp Ser Pro Ser Leu 
10 15 20 25 

CCG CGC CGC TGC AGA GAT GGC GCG CTC ATG TAT TCA CGA GCC GTT GGC 5066 
Pro Arg Arg Cys Arg Asp Gly Ala Leu Met Tyr Ser Arg Ala Val Gly 
30 35 40 



20 



30 



35 



198 



TTC ATC TGC CGC CAG TGC CGA CTT GAA GCG CAT ACG CTC TAC CTC GGA 5114 
Phe lie Cys Arg Gin Cys Arg Leu Glu Ala His Thr Leu Tyr Leu Gly 
45 50 55 

5 CTC TGT TCC CGA TGT AAA GAT CAG CAA GAG CGC ATG AAG GAA CAA AAT 5162 

Leu Cys Ser Arg Cys Lys Asp Gin Gin Glu Arg Met Lys Glu Gin Asn 
60 65 70 



10 



15 



20 



25 



30 



TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA GGTAAAGACT CTGGTGAGTC 5222 



CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT TCCCAAGCAG CAAAGGGTGC 5282 



GCAACTAGTA CGGCGCCCCC T GGGATACC A 5312 



(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Met Leu Thr lie His Leu Phe Ser Cys Pro Glu Cys Gly Leu Pro Gly 

15 10 15 

Pro Arg Leu Pro Asp Ser Pro Ser Leu Pro Arg Arg Cys Arg Asp Gly 

20 25 30 



35 Ala Leu Met Tyr Ser Arg Ala Val Gly Phe He Cys Arg Gin Cys Arg 

35 40 45 



199 



Leu Glu Ala His Thr Leu Tyr Leu Gly Leu Cys Ser Arg Cys Lys Asp 
50 55 60 



Gin Gin Glu Arg Met Lys Glu Gin Asn 
65 70 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 
20 (A) NAME /KEY: CDS 

(B) LOCATION: 283. .753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



25 



30 



35 



GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 



CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 



GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 



GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 



CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 2 94 

Met Ser Glu His 



200 



ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 
Thr lie Ala His Ser lie Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 



ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 
lie Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 
25 30 35 



AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 
Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 
40 45 50 



ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC 
Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn 
55 60 



GGC 
Gly 



GTC 
Val 
65 



GGA CCG GAA 
Gly Pro Glu 
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GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 
Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 
70 75 60 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCG CTA ACA TCT CTA 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Leu Thr Ser Leu 
85 90 95 100 

GCA TGC CCG AAT TCC GGA ATT GGG CCA AGG GAA AGA TCG ACC TCG ACT 
Ala Cys Pro Asn Ser Gly lie Gly Pro Arg Glu Arg Ser Thr Ser Thr 
105 110 115 

CCG ATT CCA TCG GCT GGT ACT TCA AGT ACC TTG ACC CAG CGG GTG CTA 
Pro lie Pro Ser Ala Gly Thr Ser Ser Thr Leu Thr Gin Arg Val Leu 
120 125 130 

CAG AGT CTG CGC GCG CCG TCG GCG AGT ACT CGA AGA TCC CTG ACG GCC 
Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg Ser Leu Thr Ala 
135 140 145 

TCG TCA AGT TCT CCG TCG ACG CAG AGA TAAGAGAGAT CTATAACGAG 
Ser Ser Ser Ser Pro Ser Thr Gin Arg 
150 155 

GAGTGCCCCG TCGTCACTGA CGTGTCCGTC CCCCTCGACG GCCGCCAGTG GAGCCTCTCG 

ATTTTCTCCT TTCCGATGTT CAGAACCGCC TACGTCGCCG TAGCGAACGT CGAGAACAAG 

GAGATGTCGC TCGACGTTGT CAACGACCTC ATCGAGTGGC TCAACAATCT CGCCGACTGG 

CGTTATGTCG TTGACTCTGA ACAGTGGATT AACTTCACCA ATGACACCAC GTACTACGTC 

CGCATCCGCG TTCTACGTCC AACCTACGAC GTTCCAGACC CCACAGAGGG CCTTGTTCGC 

ACAGTCTCAG ACTACCGCCT CACTTATAAG GC GAT AAC AT GTGAAGCCAA C AT GCCAACA 
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CTCGTCGACC AAGGCTTTTG GATCGGCGGC CAGTACGCTC TCACCCCGAC TAGCCTACCG 1193 



CAGTACGACG TCAGCGAGGC CTACGCTCTG CACACTTTGA CCTTCGCCAG ACCATCCAGC 1253 



5 GCCGCTGCAC TCGCGTTTGT GTGGGCAGGT TTGCCACAGG GTGGCACTGC GCCTGCAGGC 1313 



ACTCCAGCCT GGGAGCAGGC ATCCTCGGGT GGCTACCTCA CCTGGCGCCA CAACGGTACT 137 3 



10 



20 



30 



ACTTTCCCAG CTGGCTCCGT TAGCTACGTT CTCCCTGAGG GTTTCGCCCT TGAGCGC T AC 14 33 



GACCC GAACG ACGGCTCTTG GACCGACTTC GCTTCCGCAG GAGACACCGT CACTTTCCGG 14 93 



CAGGTCGCCG TCGACGAGGT CGTTGTGACC AACAACCCCG CCGGCGGCGG CAGCGCCCCC 1553 
15 ACCTTCACCG TGAGAGTGCC CCCTTCAAAC GCTTACACCA ACACCGTGTT TAGGAACACG 1613 



CTCTTAGAGA CTCGACCCTC CTCTCGTAGG CTCGAACTCC CTATGCCACC TGCTGACTTT 167 3 



GGACAGACGG TCGCCAACAA CCCGAAGATC GAGCAGTCGC TTCTTAAAGA AACACTTGGC 17 33 



TGCTATTTGG TCCACTCCAA AATGC GAAAC CCCGTTTTCC AGCTCACGCC AGCCAGCTCC 17 93 



TTTGGCGCCG TTTCCTTCAA CAATCCGGGT TATGAGCGCA CACGCGACCT CCCGGACTAC 1853 



25 ACTGGCATCC GTGACTCATT CGACCAGAAC ATGTCCACCG CTGTGGCCCA CTTCCGCTCA 1913 



CTCTCCCACT CCTGCAGTAT CGTCACTAAG ACCTACCAGG GTT GGGAAGG CGTCACGAAC 197 3 



GTCAACACGC CTTTCGGCCA ATTCGCGCAC GCGGGCCTCC TCAAGAATGA GGAGATCCTC 203 3 



TGCCTCGCCG ACGACCTGGC CACCCGTCTC ACAGGTGTCT ACCCCGCCAC TGACAACTTC 2093 



GCGGCCGCCG TTTCTGCCTT CGCCGCGAAC ATGCTGTCCT CCGTGCTGAA GTCGGAGGCA 215 3 



35 ACGTCCTCCA TCATCAAGTC CGTTGGCGAG ACTGCCGTCG GCGCGGCTCA GTCCGGCCTC 2213 



203 

GCGAAGCTAC CCGGACTGCT AATGAGTGTA CCAGGGAAGA TTGCCGCGCG TGTCCGCGCG 22 7 3 

CGCCGAGCGC GCCGCCGCGC CGCTCGTGCC AATTAGTTTG CTCGCTCCTG TTTCGCCGTT 2333 

5 TCGTAAAACG GCGTGGTCCC GCACATTACG CGTACCCTAA AGACTCTGGT GAGTCCCCGT 2393 

CGTTACACGA CGGGTCTGCC GCGGTTCGAT TCCATTCCCA AGCGGCAAGA AGGACGTAGT 24 53 

TAGCTCTGCG TCCCTCGGGA TACCA 2478 

10 



204 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met Ser Glu His 
1 

Thr Leu Ala Leu 
20 

Glu Trp Arg His 
35 

Gly Ser Ala Pro 
50 

Val Gly Pro Glu 
65 

His Lys Thr Ser 

Leu Thr Ser Leu 
100 

Ser Thr Ser Thr 



Thr lie Ala His 
5 

lie Pro Pro Glu 

Ser Asp Leu Thr 
40 

Thr Pro Ser Pro 
55 

Gly Lys Phe Leu 
70 

Arg Lys Ala Leu 
85 

Ala Cys Pro Asn 
Pro lie Pro Ser 



Ser lie Thr Leu 
10 

Pro Glu Ala Gly 
25 

Thr Val Ala Glu 

Ser Met Val Glu 
60 

Pro Leu Thr He 
75 

Thr Pro Thr Pro 
90 

Ser Gly He Gly 
105 

Ala Gly Thr Ser 



Pro Pro Gly Tyr 
15 

Trp Glu Met Leu 
30 

Pro Val Thr Phe 
45 

Glu Thr Asn Gly 

Ser Pro Leu Leu 
80 

Ser Leu Ser Pro 
95 

Pro Arg Glu Arg 
110 

Ser Thr Leu Thr 
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115 120 125 

Gin Arg Val Leu Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg 
130 135 140 

5 

Ser Leu Thr Ala Ser Ser Ser Ser Pro Ser Thr Gin Arg 
145 150 155 



15 



10 

(2) INFORMATION FOR SEQ ID NO: 4 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 366.. 2306 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
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GTTTTTCTTT CTTTACCAAG TGTGGTAAAA T TTAAAC AAA GAAGAAAACC AGGACCGTAA 60 



30 CCCGGCCCTT ACACACCTCG AGTCCGT GAC CACCGGAT T A TACGTCGCCC ACCACACGGC 120 



GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGT TTCGT CCGTGACCAC CCGGTTGGCA 180 



35 



GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 
CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TTATGAGCGA GCACACCATC 



240 



300 



206 



GCCCACTCCA TCACATTACC ACCCGGTTAC ACCCTTGCCC TAAT ACCCCC TGAACCTGAA 360 

GCAGG ATG GGA GAT GCT GGA GTG GCG TCA CAG CGA CCT CAC AAC CGT 407 
Met Gly Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg 
5 1 5 10 

CGC GGA ACC CGT AAC GTT CGG GTC AGC GCC AAC ACC GTC ACC GTC AAT 4 55 

Arg Gly Thr Arg Asn Val Arg Val Ser Ala Asn Thr Val Thr Val Asn 
15 20 . 25 30 



10 
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GGT AGA AGA AAC CAA CGG CGT CGG ACC GGA AGG CAA GTT TCT CCC CCT 
Gly Arg Arg Asn Gin Arg Arg Arg Thr Gly Arg Gin Val Ser Pro Pro 
35 40 45 

GAC AAT TTC ACC GCT GCT GCA CAA GAC CTC GCG CAA AGC CTT GAC GCC 
Asp Asn Phe Thr Ala Ala Ala Gin Asp Leu Ala Gin Ser Leu Asp Ala 
50 55 60 



AAC ACC GTC ACT TTC CCC GCT AAC ATC TCT AGC ATG CCC GAA TTC CGG 59 9 

20 Asn Thr Val Thr Phe Pro Ala Asn lie Ser Ser Met Pro Glu Phe Arg 

65 70 75 
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AAT TGG GCC AAG GGA AAG ATC GAC CTC GAC TCC GAT TCC ATC GGC TGG 
Asn Trp Ala Lys Gly Lys lie Asp Leu Asp Ser Asp Ser lie Gly Trp 
80 85 90 

TAC TTC AAG TAC CTT GAC CCA GCG GGT GCT ACA GAG TCT GCG CGC GCC 
Tyr Phe Lys Tyr Leu Asp Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala 
95 100 105 110 

GTC GGC GAG TAC TCG AAG ATC CCT GAC GGC CTC GTC AAG TTC TCC GTC 
Val Gly Glu Tyr Ser Lys lie Pro Asp Gly Leu Val Lys Phe Ser Val 
115 120 125 

GAC GCA GAG ATA AGA GAG ATC TAT AAC GAG GAG TGC CCC GTC GTC ACT 
Asp Ala Glu lie Arg Glu lie Tyr Asn Glu Glu Cys Pro Val Val Thr 
130 135 140 

GAC GTG TCC GTC CCC CTC GAC GGC CGC CAG TGG AGC CTC TCG ATT TTC 
Asp Val Ser Val Pro Leu Asp Gly Arg Gin Trp Ser Leu Ser lie Phe 
145 150 155 

TCC TTT CCG ATG TTC AGA ACC GCC TAC GTC GCC GTA GCG AAC GTC GAG 
Ser Phe Pro Met Phe Arg Thr Ala Tyr Val Ala Val Ala Asn Val Glu 
160 165 170 

AAC AAG GAG ATG TCG CTC GAC GTT GTC AAC GAC CTC ATC GAG TGG CTC 
Asn Lys Glu Met Ser Leu Asp Val Val Asn Asp Leu lie Glu Trp Leu 
175 180 185 190 

AAC AAT CTC GCC GAC TGG CGT TAT GTC GTT GAC TCT GAA CAG TGG ATT 
Asn Asn Leu Ala Asp Trp Arg Tyr Val Val Asp Ser Glu Gin Trp lie 
195 200 205 

AAC TTC ACC AAT GAC ACC ACG TAC TAC GTC CGC ATC CGC GTT CTA CGT 
Asn Phe Thr Asn Asp Thr Thr Tyr Tyr Val Arg lie Arg Val Leu Arg 
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CCA ACC TAC GAC GTT CCA GAC CCC ACA GAG GGC CTT GTT CGC ACA GTC 107 9 

Pro Thr Tyr Asp Val Pro Asp Pro Thr Glu Gly Leu Val Arg Thr Val 
225 230 235 

5 TCA GAC TAC CGC CTC ACT TAT AAG GCG ATA ACA TGT GAA GCC AAC ATG 1127 

Ser Asp Tyr Arg Leu Thr Tyr Lys Ala lie Thr Cys Glu Ala Asn Met 
240 245 250 

CCA ACA CTC GTC GAC CAA GGC TTT TGG ATC GGC GGC CAG TAC GCT CTC 117 5 

10 Pro Thr Leu Val Asp Gin Gly Phe Trp lie Gly Gly Gin Tyr Ala Leu 

255 260 265 270 

ACC CCG ACT AGC CTA CCG CAG TAC GAC GTC AGC GAG GCC TAC GCT CTG 1223 

Thr Pro Thr Ser Leu Pro Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu 
15 275 280 285 

CAC ACT TTG ACC TTC GCC AGA CCA TCC AGC GCC GCT GCA CTC GCG TTT 1271 

His Thr Leu Thr Phe Ala Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe 
290 295 300 
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GTG TGG GCA GGT TTG CCA CAG GGT GGC ACT GCG CCT GCA GGC ACT CCA 1319 

Val Trp . Ala Gly Leu Pro Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro 
305 310 315 

GCC TGG GAG CAG GCA TCC TCG GGT GGC TAC CTC ACC TGG CGC CAC AAC 1367 

Ala Trp Glu Gin Ala Ser Ser Gly Gly Tyr Leu Thr Trp Arg His Asn 
320 325 330 



GGT ACT ACT TTC CCA GCT GGC TCC GTT AGC TAC GTT CTC CCT GAG GGT 
30 Gly Thr Thr Phe Pro Ala Gly Ser Val Ser Tyr Val Leu Pro Glu Gly 

335 340 345 350 



TTC GCC CTT GAG CGC TAC GAC CCG AAC GAC GGC TCT TGG ACC GAC TTC 
Phe Ala Leu Glu Arg Tyr Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe 
355 360 365 



35 
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GCT TCC GCA GGA GAC ACC GTC ACT TTC CGG CAG GTC GCC GTC GAC GAG 1511 
Ala Ser Ala Gly Asp Thr Val Thr Phe Arg Gin Val Ala Val Asp Glu 
370 375 380 

5 GTC GTT GTG ACC AAC AAC CCC GCC GGC GGC GGC AGC GCC CCC ACC TTC 155 9 

Val Val Val Thr Asn Asn Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe 
385 390 395 

ACC GTG AGA GTG CCC CCT TCA AAC GCT TAC ACC AAC ACC GTG TTT AGG 1607 
10 Thr Val Arg Val Pro Pro Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg 

400 405 410 

AAC ACG CTC TTA GAG ACT CGA CCC TCC TCT CGT AGG CTC GAA CTC CCT 1655 
Asn Thr Leu Leu Glu Thr Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro 
15 415 420 425 430 

ATG CCA CCT GCT GAC TTT GGA CAG ACG GTC GCC AAC AAC CCG AAG ATC 17 03 

Met Pro Pro Ala Asp Phe Gly Gin Thr Val Ala Asn Asn Pro Lys lie 
435 440 445 

20 

GAG CAG TCG CTT CTT AAA GAA ACA CTT GGC TGC TAT TTG GTC CAC TCC 17 51 

Glu .Gin Ser Leu Leu Lys Glu Thr Leu Gly Cys Tyr Leu Val His Ser 
450 455 460 

25 AAA ATG CGA AAC CCC GTT TTC CAG CTC ACG CCA GCC AGC TCC TTT GGC 17 99 

Lys Met Arg Asn Pro Val Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly 
465 470 475 

GCC GTT TCC TTC AAC AAT CCG GGT TAT GAG CGC ACA CGC GAC CTC CCG 184 7 

30 Ala Val Ser Phe Asn Asn Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro 

480 485 490 

GAC TAC ACT GGC ATC CGT GAC TCA TTC GAC CAG AAC ATG TCC ACC GCT 18 9 5 

Asp Tyr Thr Gly lie Arg Asp Ser Phe Asp Gin Asn Met Ser Thr Ala 
35 495 500 505 510 
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GTG GCC CAC TTC CGC TCA CTC TCC CAC TCC TGC AGT ATC GTC ACT AAG 1943 
Val Ala His Phe Arg Ser Leu Ser His Ser Cys Ser lie Val Thr Lys 
515 520 525 

5 ACC TAC CAG GGT TGG GAA GGC GTC ACG AAC GTC AAC ACG CCT TTC GGC 1991 

Thr Tyr Gin Gly Trp Glu Gly Val Thr Asn Val Asn Thr Pro Phe Gly 
530 535 540 

CAA TTC GCG CAC GCG GGC CTC CTC AAG AAT GAG GAG ATC CTC TGC CTC 2039 
10 Gin Phe Ala His Ala Gly Leu Leu Lys Asn Glu Glu lie Leu Cys Leu 

545 550 555 

GCC GAC GAC CTG GCC ACC CGT CTC ACA GGT GTC TAC CCC GCC ACT GAC 2087 
Ala Asp Asp Leu Ala Thr Arg Leu Thr Gly Val Tyr Pro Ala Thr Asp 
15 560 565 570 

AAC TTC GCG GCC GCC GTT TCT GCC TTC GCC GCG AAC ATG CTG TCC TCC 2135 

Asn Phe Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser 

575 580 585 590 

20 

GTG CTG AAG TCG GAG GCA ACG TCC TCC ATC ATC AAG TCC GTT GGC GAG 218 3 

Val Leu Lys Ser Glu Ala Thr Ser Ser He He Lys Ser Val Gly Glu 

595 600 605 

25 ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC GCG AAG CTA CCC GGA CTG 22 31 

Thr Ala Val Gly Ala Ala Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu 
610 615 620 

CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG CGT GTC CGC GCG CGC CGA 227 9 

30 Leu Met Ser Val Pro Gly Lys He Ala Ala Arg Val Arg Ala Arg Arg 

625 630 635 

GCG CGC CGC CGC GCC GCT CGT GCC AAT TAGTTTGCTC GCTCCTGTTT 2326 
Ala Arg Arg Arg Ala Ala Arg Ala Asn 
35 640 645 
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CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT ACCCTAAAGA CTCTGGTGAG 2386 



TCCCCGTCGT TACACGACGG GTCTGCCGCG GTTCGATTCC ATTCCCAAGC GGCAAGAAGG 24 4 6 



AC GT AGTTAG CTCTGCGTCC CTCGGGATAC CA 24 78 



(2) INFORMATION FOR SEQ ID NO: 50: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

{ii} MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

20 

Met Giy Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg Arg Gly 

- -1 - - . .5 10 15 



Thr Arg Asn Val Arg Val Ser Ala Asn Thr Val Thr Val Asn Gly Arg 
25 20 25 30 



Arg Asn Gin Arg Arg Arg Thr Gly Arg Gin Val Ser Pro Pro Asp Asn 
35 40 45 



30 Phe Thr Ala Ala Ala Gin Asp Leu Ala Gin Ser Leu Asp Ala Asn Thr 

50 55 60 



Val Thr Phe Pro Ala Asn He Ser Ser Met Pro Glu Phe Arg Asn Trp 
65 70 75 80 



35 



Ala Lys Gly Lys He Asp Leu Asp Ser Asp Ser He Gly Trp Tyr Phe 
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85 90 95 

Lys Tyr Leu Asp Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly 
100 105 110 

Glu Tyr Ser Lys lie Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala 
115 120 125 

Glu lie Arg Glu lie Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val 
130 135 140 

Ser Val Pro Leu Asp Gly Arg Gin Trp Ser Leu Ser lie Phe Ser Phe 
145 150 155 160 

Pro Met Phe Arg Thr Ala Tyr Val Ala Val Ala Asn Val Glu Asn Lys 
165 170 175 

Glu Met Ser Leu Asp Val Val Asn Asp Leu lie Glu Trp Leu Asn Asn 
180 185 190 

Leu Ala Asp Trp Arg Tyr Val Val Asp Ser Glu Gin Trp lie Asn Phe 
195 200 205 

Thr Asn Asp Thr Thr Tyr Tyr Val Arg lie Arg Val Leu Arg Pro Thr 
210 215 220 

Tyr Asp Val Pro Asp Pro Thr Glu Gly Leu Val Arg Thr Val Ser Asp 
225 230 235 240 

Tyr Arg Leu Thr Tyr Lys Ala lie Thr Cys Glu Ala Asn Met Pro Thr 
245 250 255 

Leu Val Asp Gin Gly Phe Trp lie Gly Gly Gin Tyr Ala Leu Thr Pro 
260 265 270 

Thr Ser Leu Pro Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu His Thr 
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275 280 285 

Leu Thr Phe Ala Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe Val Trp 
290 295 300 

Ala Gly Leu Pro Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro Ala Trp 
305 310 315 320 

Glu Gin Ala Ser Ser Gly Gly Tyr Leu Thr Trp Arg His Asn Gly Thr 
325 330 335 

Thr Phe Pro Ala Gly Ser Val Ser Tyr Val Leu Pro Glu Gly Phe Ala 
340 345 350 

Leu Glu Arg Tyr Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe Ala Ser 
355 360 365 

Ala Gly Asp Thr Val Thr Phe Arg Gin Val Ala Val Asp Glu Val Val 
370 375 380 

Val Thr Asn Asn Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe Thr Val 
385 . 390 395 400 

Arg Val Pro Pro Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg Asn Thr 
405 410 415 

Leu Leu Glu Thr Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro Met Pro 
420 425 430 

Pro Ala Asp Phe Gly Gin Thr Val Ala Asn Asn Pro Lys lie Glu Gin 
435 440 445 

Ser Leu Leu Lys Glu Thr Leu Gly Cys Tyr Leu Val His Ser Lys Met 
450 455 460 ' 

Arg Asn Pro Val Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly Ala Val 
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465 470 475 480 

Ser Phe Asn Asn Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro Asp Tyr 
485 490 495 

Thr Gly lie Arg Asp Ser Phe Asp Gin Asn Met Ser Thr Ala Val Ala 
500 505 510 



His Phe Arg Ser 
515 

Gin Gly Trp Glu 

530 

Ala His Ala Gly 
545 

Asp Leu Ala Thr 

Ala Ala Ala Val 
580 

Lys Ser Glu Ala 
595 

Val Gly Ala Ala 
610 

Ser Val Pro Gly 
625 



Leu Ser His Ser 
520 

Gly Val Thr Asn 

535 

Leu Leu Lys Asn 
550 

Arg Leu Thr Gly 
565 

Ser Ala Phe Ala 

Thr Ser Ser lie 
600 

Gin Ser Gly Leu 
615 

Lys lie Ala Ala 
630 



Cys Ser lie Val 

Val Asn Thr Pro 
540 

Glu Glu lie Leu 
555 

Val Tyr Pro Ala 
570 

Ala Asn Met Leu 
585 

He Lys Ser Val 

Ala Lys Leu Pro 
620 

Arg Val Arg Ala 
635 



Thr Lys Thr Tyr 
525 

Phe Gly Gin Phe 

Cys Leu Ala Asp 
560 

Thr Asp Asn Phe 
575 

Ser Ser Val Leu 
590 

Gly Glu Thr Ala 
605 

Gly Leu Leu Met 

Arg Arg Ala Arg 
640 



Arg Arg Ala Ala Arg Ala Asn 
645 



a 

m 
sj 

M 
m 

m 
w 

= 

a 
a 

UJ 

o - 
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10 



15 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 51: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2479 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283.. 2307 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 : 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA TTTAAACAAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCC GTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 18 0 

GTCGACAGAC GCTTCCGGAC C AC TAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 2 94 

Met Ser Glu His 
1 

ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 342 
Thr lie Ala His Ser lie Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 ' 20 

ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 390 
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lie Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 
25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 4 38 

5 Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 4 86 

Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 
10 55 60 65 

GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 534 
Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 
70 75 80 

15 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCC GCT AAC ATC TCT 582 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Ala Asn lie Ser 
85 90 95 100 

20 AGC ATG CCC GAA TTC CGG AAT TGG GCC AAG GGA AAG ATC GAC CTC GAC 630 

Ser Met Pro Glu Phe Arg Asn Trp Ala Lys Gly Lys lie Asp Leu Asp 
" " " ' 105 110 _ 115 

TCC GAT TCC ATC GGC TGG TAC TTC AAG TAC CTT GAC CCA GCG GGT GCT 67 8 

25 Ser Asp Ser lie Gly Trp Tyr Phe Lys Tyr Leu Asp Pro Ala Gly Ala 

120 125 130 

ACA GAG TCT GCG CGC GCC GTC GGC GAG TAC TCG AAG ATC CCT GAC GGC 72 6 

Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys lie Pro Asp Gly 
30 135 140 145 

CTC GTC AAG TTC TCC GTC GAC GCA GAG ATA AGA GAG ATC TAT AAC GAG 774 
Leu Val Lys Phe Ser Val Asp Ala Glu lie Arg Glu lie Tyr Asn Glu 
150 155 160 

35 

GAG TGC CCC GTC GTC ACT GAC GTG TCC GTC CCC CTC GAC GGC CGC CAG 822 
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Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu Asp Gly Arg Gin 
165 170 175 180 

TGG AGC CTC TCG ATT TTC TCC TTT CCG ATG TTC AGA ACC GCC TAC GTC 
5 Trp Ser Leu Ser lie Phe Ser Phe Pro Met Phe Arg Thr Ala Tyr Val 

185 190 195 

GCC GTA GCG AAC GTC GAG AAC AAG GAG ATG TCG CTC GAC GTT GTC AAC 
Ala Val Ala Asn Val Glu Asn Lys Glu Met Ser Leu Asp Val Val Asn 
10 200 205 210 

GAC CTC ATC GAG TGG CTC AAC AAT CTC GCC GAC TGG CGT TAT GTC GTT 

Asp Leu lie Glu Trp Leu Asn Asn Leu Ala Asp Trp Arg Tyr Val Val 
215 220 225 

15 

GAC TCT GAA CAG TGG ATT AAC TTC ACC AAT GAC ACC ACG TAC TAC GTC 

Asp Ser Glu Gin Trp lie Asn Phe Thr Asn Asp Thr Thr Tyr Tyr Val 
230 235 240 



20 



CGC ATC CGC GTT CTA CGT CCA ACC TAC GAC GTT CCA GAC CCC ACA GAG 
Arg lie Arg Val Leu Arg Pro Thr Tyr Asp Val Pro Asp Pro Thr Glu 
245 . _ 250 255 260 



1062 
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GGC CTT GTT CGC ACA GTC TCA GAC TAC CGC CTC ACT TAT AAG GCG ATA 1110 

Gly Leu Val Arg Thr Val Ser Asp Tyr Arg Leu Thr Tyr Lys Ala lie 

265 270 275 

5 

ACA TGT GAA GCC AAC ATG CCA ACA CTC GTC GAC CAA GGC TTT TGG ATC 1158 

Thr Cys Glu Ala Asn Met Pro Thr Leu Val Asp Gin Gly Phe Trp lie 

280 285 290 

10 GGC GGC CAG TAC GCT CTC ACC CCG ACT AGC CTA CCG CAG TAC GAC GTC 1206 

Gly Gly Gin Tyr Ala Leu Thr Pro Thr Ser Leu Pro Gin Tyr Asp Val 
295 300 305 



AGC GAG GCC TAC GCT CTG CAC ACT TTG ACC TTC GCC AGA CCA TCC AGC 1254 

/ 

15 Ser Glu Ala Tyr Ala Leu His Thr Leu Thr Phe Ala Arg Pro Ser Ser 

310 315 320 



GCC GCT GCA CTC GCG TTT GTG TGG GCA GGT TTG CCA CAG GGT GGC ACT 1302 

Ala Ala Ala Leu Ala Phe Val Trp Ala Gly Leu Pro Gin Gly Gly Thr 
20 325 330 335 340 

_GCG CCT GCA GGC ACT CCA GCC TGG GAG CAG GCA TCC TCG GGT GGC TAC 1350 

Ala Pro Ala Gly Thr Pro Ala Trp Glu Gin Ala Ser Ser Gly Gly Tyr 
345 350 355 



25 



30 



35 



CTC ACC TGG CGC CAC AAC GGT ACT ACT TTC CCA GCT GGC TCC GTT AGC 1398 

Leu Thr Trp Arg His Asn Gly Thr Thr Phe Pro Ala Gly Ser Val Ser 

360 365 370 

TAC GTT CTC CCT GAG GGT TTC GCC CTT GAG CGC TAC GAC CCG AAC GAC 1446 

Tyr Val Leu Pro Glu Gly Phe Ala Leu Glu Arg Tyr Asp Pro Asn Asp 
375 380 385 

GGC TCT TGG ACC GAC TTC GCT TCC GCA GGA GAC ACC GTC ACT TTC CGG 14 94 

Gly Ser Trp Thr Asp Phe Ala Ser Ala Gly Asp Thr Val Thr Phe Arg 

390 395 400 
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CAG GTC GCC GTC GAC GAG GTC GTT GTG ACC AAC AAC CCC GCC GGC GGC 
Gin Val Ala Val Asp Glu Val Val Val Thr Asn Asn Pro Ala Gly Gly 
405 410 415 420 

GGC AGC GCC CCC ACC TTC ACC GTG AGA GTG CCC CCT TCA AAC GCT TAC 
Gly Ser Ala Pro Thr Phe Thr Val Arg Val Pro Pro Ser Asn Ala Tyr 
425 430 435 

ACC AAC ACC GTG TTT AGG AAC ACG CTC TTA GAG ACT CGA CCC TCC TCT 
Thr Asn Thr Val Phe Arg Asn Thr Leu Leu Glu Thr Arg Pro Ser Ser 
440 445 450 

CGT AGG CTC GAA CTC CCT ATG CCA CCT GCT GAC TTT GGA CAG ACG GTC 
Arg Arg Leu Glu Leu Pro Met Pro Pro Ala Asp Phe Gly Gin Thr Val 
455 460 465 

GCC AAC AAC CCG AAG ATC GAG CAG TCG CTT CTT AAA GAA ACA CTT GGC 
Ala Asn Asn Pro Lys lie Glu Gin Ser Leu Leu Lys Glu Thr Leu Gly 
470 475 480 

TGC TAT TTG GTC CAC TCC AAA ATG CGA AAC CCC GTT TTC CAG CTC ACG 
Cys Tyr Leu Val His Ser Lys Met Arg Asn Pro Val Phe Gin Leu Thr 
485 490 495 500 

CCA GCC AGC TCC TTT GGC GCC GTT TCC TTC AAC AAT CCG GGT TAT GAG 
Pro Ala Ser Ser Phe Gly Ala Val Ser Phe Asn Asn Pro Gly Tyr Glu 
505 510 515 

CGC ACA CGC GAC CTC CCG GAC TAC ACT GGC ATC CGT GAC TCA TTC GAC 
Arg Thr Arg Asp Leu Pro Asp Tyr Thr Gly lie Arg Asp Ser Phe Asp 
520 525 530 

CAG AAC ATG TCC ACC GCT GTG GCC CAC TTC CGC TCA CTC TCC CAC TCC 
Gin Asn Met Ser Thr Ala Val Ala His Phe Arg Ser Leu Ser His Ser 
535 540 545 
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TGC AGT ATC GTC ACT AAG ACC TAC CAG GGT TGG GAA GGC GTC ACG AAC 197 4 

Cys Ser lie Val Thr Lys Thr Tyr Gin Gly Trp Glu Gly Val Thr Asn 
550 555 560 

5 GTC AAC ACG CCT TTC GGC CAA TTC GCG CAC GCG GGC CTC CTC AAG AAT 2022 

Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly Leu Leu Lys Asn 
565 570 575 580 

GAG GAG ATC CTC TGC CTC GCC GAC GAC CTG GCC ACC CGT CTC ACA GGT 207 0 

10 Glu Glu lie Leu Cys Leu Ala Asp Asp Leu Ala Thr Arg Leu Thr Gly 

585 590 595 

GTC TAC CCC GCC ACT GAC AAC TTC GCG GCC GCC GTT TCT GCC TTC GCC 2118 
Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val Ser Ala Phe Ala 
15 600 605 610 

GCG AAC ATG CTG TCC TCC GTG CTG AAG TCG GAG GCA ACG TCC TCC ATC 2166 

Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala Thr Ser Ser He 

615 620 625 

20 

ATC AAG TCC GTT GGC GAG ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC 2214 

He Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala Gin Ser Gly Leu 

630 635 640 

25 GCG AAG CTA CCC GGA CTG CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG 2262 

Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly Lys He Ala Ala 
645 650 655 660 

CGT GTC CGC GCG CGC CGA GCG CGC CGC CGC GCC GCT CGT GCC AAT 2307 
30 Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala Arg Ala Asn 

665 670 675 



35 



TAGTTTGCTC GCTCCTGTTT CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CAT T ACGC GT 
ACCCTAAAGA CTCTGGTGAG TCCCCGTCGT TAC AC GAC GG GTCTGCCGCG GTTCGATTCC 



2367 
2427 
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ATTCCCAAGC GGCAAGAAGG ACGT AGT TAG CTCTGCGTCC C TCGGGAT AC CA 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Met Ser Glu His 
1 

Thr Leu Ala Leu 
20 

Glu Trp Arg His 
35 

Gly Ser Ala Pro 
50 

Val Gly Pro Glu 
65 

His Lys Thr Ser 

Ala Asn lie Ser 
100 



Thr lie Ala His 
5 

lie Pro Pro Glu 

Ser Asp Leu Thr 
40 

Thr Pro Ser Pro 
55 

Gly Lys Phe Leu 
70 

Arg Lys Ala Leu 
85 

Ser Met Pro Glu 



Ser lie Thr Leu 
10 

Pro Glu Ala Gly 
25 

Thr Val Ala Glu 

Ser Met Val Glu 
60 

Pro Leu Thr lie 
75 

Thr Pro Thr Pro 
90 

Phe Arg Asn Trp 
105 



Pro Pro Gly Tyr 
15 

Trp Glu Met Leu 
30 

Pro Val Thr Phe 
45 

Glu Thr Asn Gly 

Ser Pro Leu Leu 
80 

Ser Leu Ser Pro 
95 

Ala Lys Gly Lys 
110 



223 



lie Asp Leu Asp Ser Asp Ser lie Gly Trp Tyr Phe Lys Tyr Leu Asp 
115 120 125 

Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys 
130 135 140 

lie Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala Glu lie Arg Glu 
145 150 155 160 

lie Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu 
165 170 175 

Asp Gly Arg Gin Trp Ser Leu Ser lie Phe Ser Phe Pro Met Phe Arg 
180 185 190 

Thr Ala Tyr Val Ala Val Ala Asn Val Glu Asn Lys Glu Met Ser Leu 
195 200 205 

Asp Val Val Asn Asp Leu lie Glu Trp Leu Asn Asn Leu Ala Asp Trp 
210 215 220 

Arg Tyr Val Val Asp Ser Glu Gin Trp lie Asn Phe Thr Asn Asp Thr 
225 230 235 240 

Thr Tyr Tyr Val Arg lie Arg Val Leu Arg Pro Thr Tyr Asp Val Pro 
245 250 255 

Asp Pro Thr Glu Gly Leu Val Arg Thr Val Ser Asp Tyr Arg Leu Thr 
260 265 270 

Tyr Lys Ala lie Thr Cys Glu Ala Asn Met Pro Thr Leu Val Asp Gin 
275 280 285 



Gly Phe Trp lie Gly Gly Gin Tyr Ala Leu Thr Pro Thr Ser Leu Pro 
290 295 300 
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Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu His Thr Leu Thr Phe Ala 
305 310 315 320 

Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe Val Trp Ala Gly Leu Pro 
325 330 335 

Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro Ala Trp Glu Gin Ala Ser 
340 345 350 

Ser Gly Gly Tyr Leu Thr Trp Arg His Asn Gly Thr Thr Phe Pro Ala 

355 360 365 

Gly Ser Val Ser Tyr Val Leu Pro Glu Gly Phe Ala Leu Glu Arg Tyr 
370 375 380 

Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe Ala Ser Ala Gly Asp Thr 
3B5 390 395 400 

Val Thr Phe Arg Gin Val Ala Val Asp Glu Val Val Val Thr Asn Asn 
405 410 415 

-Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe Thr Val Arg Val Pro Pro 
420 425 430 

Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg Asn Thr Leu Leu Glu Thr 
435 440 445 

Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro Met Pro Pro Ala Asp Phe 
450 455 460 

Gly Gin Thr Val Ala Asn Asn Pro Lys lie Glu Gin Ser Leu Leu Lys 
465 470 475 480 



Glu Thr Leu Gly Cys Tyr Leu Val His Ser Lys Met Arg Asn Pro Val 
485 490 495 
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Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly Ala Val Ser Phe Asn Asn 
500 ' 505 510 

Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro Asp Tyr Thr Gly lie Arg 
515 520 525 

Asp Ser Phe Asp Gin Asn Met Ser Thr Ala Val Ala His Phe Arg Ser 
530 535 540 

Leu Ser His Ser Cys Ser lie Val Thr Lys Thr Tyr Gin Gly Trp Glu 
545 550 555 560 

Gly Val Thr Asn Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly 
565 570 575 

Leu Leu Lys Asn Glu Glu lie Leu Cys Leu Ala Asp Asp Leu Ala Thr 
580 585 590 

Arg Leu Thr Gly Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val 
595 600 605 

Ser Ala- Phe Ala Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala 
610 615 620 

Thr Ser Ser lie lie Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala 
625 630 635 640 

Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu Leu Met Ser Vai Pro Gly 
645 650 655 

Lys lie Ala Ala Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala 
660 665 670 

Arg Ala Asn 
675 



